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Abstract 


We examine the role of strategic motivations in mediating the relationship between underlying political 
preferences and vote choice, in a multiparty, single member, simple plurality system, and examine the role 
of constituency context in determining the scope for strategic voting. Political preference data from the 
British Election Panel Survey, 1997—2001, were modelled with mixed multinomial logit models. Latent 
variables were used to model the stable party political traits underlying observed preferences, allowing 
correlation between choices and so avoiding the restrictive assumption of independence from irrelevant 
alternatives. Ranked approval ratings were used to characterize the underlying political preferences in 
the presence of insincere voting. From these models we estimate that approximately 9% of votes cast 
may have been affected by strategic factors. In keeping with ‘Duvergers law’, the smallest of the three 
main parties, the Liberal Democrats, were found to be most affected by strategic voting. 
© 2006 Elsevier Ltd. All rights reserved. 
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Introduction 


There is, nowadays, widespread recognition that local context plays an important part in the 
voting decision (e.g. Agnew, 1987; Cox, 1969; Johnston, Pattie, & Allsopp, 1988). Numerous 


* Corresponding author. Tel.: +44 161 275 4721; fax: +44 161 275 4722. 
E-mail addresses: e.fieldhouse@manchester.ac.uk (E. Fieldhouse), n.shryane@manchester.ac.uk (N. Shryane), 
andrew.pickles @ manchester.ac.uk (A. Pickles). 
' Tel.: +44 161 275 4738. 
> Tel.: +44 161 275 5204. 


0962-6298/$ - see front matter © 2006 Elsevier Ltd. All rights reserved. 
doi: 10.1016/j.polgeo.2006.09.005 


160 E. Fieldhouse et al. / Political Geography 26 (2007) 159-178 


explanations have been put forward for the persistent clustering of like-minded voters in 
geographical space. Many such explanations can be traced back to Cox (1969) and include ac- 
quaintance circle bias or social interaction effects (e.g. Huckfeldt & Sprague, 1995; Miller, 
1977); environmental observation or common experiences (e.g. Pattie & Johnston, 1995); 
and party mobilisation (e.g. Pattie, Johnston, & Fieldhouse, 1995). Others have argued that 
so-called neighbourhood effects are the artefact of unmeasured differences in composition 
(e.g. McAllister & Studlar, 1992). At the same time, strategic, non-sincere or tactical voting 
has become widely recognised as being increasingly influential in modern elections, especially 
in single member plurality systems where there are more than two effective competitors (e.g. 
Great Britain). Whilst there is a substantial literature on strategic voting and its geographical 
properties (see below), strategic voting has not been widely discussed as a driver of geograph- 
ical clustering in voting. We argue that strategic voting is of particular interest to political 
geographers because it is one of the most tangible ways in which local (constituency) context 
can directly affect the decisions of rational voters on polling day. 

A significant impediment to understanding how strategic behaviour effects the geography of 
voting (and vice versa) is that there is no clear consensus on how it should be measured, nor on 
the extent to which subjective notions of strategic voting are consistent with the objective 
conditions under which such behaviour should (rationally) take place. Blais and Nadeau 
(1996) and Alvarez and Nagler (2000) identify three approaches to identify strategic voting 
based on (i) using aggregate data (e.g. Johnston & Pattie, 1991; Spafford, 1972); (ii) self- 
reported strategic voting (e.g. Evans & Heath, 1993; Niemi, Whitten, & Franklin, 1992); and 
(iii) divergence between stated preferences and vote (e.g. Black, 1978; Blais & Nadeau, 
1996). Recognising the potential weaknesses in each of these approaches, like Alavarez and 
Nagler we adopt an approach based on the modelling of discrepancies between stated prefer- 
ences and reported behaviour, whilst taking into account the objective strategic context (cf. 
Cain, 1978). The modelling approach, which utilizes repeated response data to identify latent 
preference classes in mixed multinomial logit (MMNL) models, is also substantially different 
and more likely to provide reliable estimates. We draw upon data from the British Election 
Panel Study for the 1997 and 2001 General Elections in order to examine the extent of 
“‘non-preference voting’ broadly defined, and strategic voting, both subjectively and objectively 
defined. The approach both allows us to better represent preferences and voting for third or 
minor parties in a multiparty system, in this instance the British Liberal Democrats, and pro- 
vides a more general framework for modelling party preference and voting behaviour. 


Preferences and votes 


It has been argued previously that preference and vote are not equivalent and that voters are 
more likely to betray their preferences where the objective conditions dictate (Black, 1978; 
Cain, 1978). In other words voting contrary to one’s stated party preferences is related to the 
electoral context, or more precisely in the U.K., to the strategic situation in the constituency 
as this will shape the likelihood that a vote for a preferred candidate will be influential in 
the final outcome of the particular election or district. The rationale for this is straightforward: 
should a voter perceive a vote cast for their preferred candidate to be most likely wasted, that 
voter may opt to vote for another party. This is equally true in U.S. presidential elections where 
strategic voting may depend on the viability of the preferred candidate (e.g. Abramson, Aldrich, 
Paolino, & Rohde, 1992). This is consistent with a rational choice approach and decision the- 
ory: McKelvey and Ordeshook and others have shown that a rational voter may vote against his 
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or her most preferred party where it has little or no chance of winning in a multiparty election 
(Cain, 1978; McKelvey & Ordeshook, 1972). This can also be considered ‘Duvergian’ tactical 
voting (Fisher, 2004) as it is consistent with Duverger’s law which states that third or lower- 
placed parties are deserted by rational voters in a single member simple plurality system (Riker, 
1982, Cox, 1997; Duverger, 1954). We would expect that strategic voting is more likely to af- 
fect smaller parties, as they are less likely to be considered viable, and parties in the political 
centre since (if preferences are single-peaked) they are more likely to be the second preference 
of voters on both the left and the right. 

The Liberal Democrats in Britain represent an archetypal example of a third party in a pre- 
dominately two party system, elected under a single member simple plurality scheme. In terms 
of the well-established left—right spectrum of political parties, they represent the centre party, 
outflanked by the Conservatives on the right and the Labour Party on the left, though by 2001 
many commentators were arguing that the Liberal Democrats were to the left of Labour (Bara, 
2001; see also Russell & Fieldhouse, 2005 for discussion). Added to this, they have the least 
well-defined social constituency of support (see Russell, Fieldhouse, & MacAllister, 2002). 
Consistent with Duverger’s law, the Liberal Democrat vote is particularly sensitive to tactical 
or strategic voting since, as the centre party, the Liberal Democrats are often regarded by the 
supporters of the major parties as their second choice; and their own supporters may be espe- 
cially likely to vote strategically for another party because of the probability of a wasted vote or 
third party ‘squeeze’ (see Cain, 1978; Spafford, 1972). This combination of factors has meant 
that Liberal Democrat voting is relatively ‘soft’ and notoriously hard to explain (see Russell & 
Fieldhouse, 2005). 

Perhaps because of this difficulty, much research into recent British elections has tended to 
focus on the vote share between the two major parties (Labour and Conservative). The third 
party has often been ignored or cast as an influence on the other parties rather than as a straight- 
forward alternative in its own right. This is despite support for the Liberal Democrats being 
quite substantial; with around 20% of the vote share in General Elections since the 1980s. 
This is due, in no small part, to the relative simplicity of and familiarity with, methodological 
tools for binary choices (especially logistic regression). However, this type of approach does 
not deal with the choices that voters actually face and has been found to misestimate vote share 
between the larger parties in part because of the influence of the overlooked smaller party 
(Heath, Jowell, Curtice, Field, & Witherspoon, 1991). More realistic modelling approaches 
must deal with the polytomous choice presented in multiparty elections. 

Multinomial logit (MNL) allows for the modelling of discrete choices with more than two 
alternative parties (e.g. Whitten & Palmer, 1996). However, it has been criticized because of its 
restrictive assumption of “independence from irrelevant alternatives” (ITA). With IIA the rel- 
ative probability of choosing between any pair of alternatives must not depend on the presence 
(or absence) of other alternatives in the choice set. In effect, this assumption implies that voters 
must not view any parties as being clustered together, in the sense that the parties in the cluster 
could be seen as substitutes. This is patently not the case in British party politics, where the 
Liberal Democrats are currently seen as a closer substitute for the Labour Party than for the 
Conservative Party (Russell & Fieldhouse, 2005). 

Other authors have used multinomial probit (MNP) models, which allow for (limited) 
specification of a covariance structure for the disturbances and so do not impose the IIA 
assumption (e.g. Alvarez & Nagler, 1995; Alvarez, Nagler, & Bowler, 2000). A recent review 
(Dow & Endersby, 2004) found little difference between MNL and MNP estimates when 
modelling the same UK multiparty election data, however. 
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We use an elaboration of the basic MNL model that includes choice-specific random effects, 
referred to as a mixed multinomial logit (MMNL) model. The advantage of MMNL is its flex- 
ibility; any discrete choice model consistent with random utility maximization can be approx- 
imated by an MMNL model (McFadden & Train, 2000). This is not the case for the standard 
MNP, where, for example, non-normally distributed random terms cannot be accommodated 
(Hensher & Green, 2003). This is of significance in this paper, where, because of non-normally 
distributed latent party preferences, we present a model with discrete distributions for the ran- 
dom terms to estimate latent classes of political preference. 


Preference and vote in the 1997 and 2001 General Elections in England 


Data from two waves of the British Election Panel Study 1997—2001 (BEPS-2; Thomson, 
Park, & Brook, 1999) were used. These waves, part of the 30-year ongoing British Election 
Studies, were conducted shortly after the 1997 and 2001 General Elections. Although the sur- 
vey was also conducted in Wales and Scotland, only respondents living in England were 
utilized as they represented the largest group with the same choice set of three parties in 
each constituency — Conservative, Labour and Liberal Democrat. 

There were 2551 respondents in the panel in the 1997 wave and 1709 remaining in the 2001 
wave. Only data from waves where individuals had a clear party of first preference and reported 
voting for one of the three major parties were retained. (Support for parties other than the main 
three was very limited; only 69 respondents reported voting for a different party, and an addi- 
tional 17 said that they supported other parties even though they voted for a main party.) Fur- 
ther, 75 respondents with missing covariates were discarded. This resulted in 1914 respondents 
contributing matched preferences and votes for 2892 voting occasions. 

To obtain a measure of party preference we constructed a ranking based upon party approval 
ratings and party identification. Rankings were preferred to using the scale responses because 
no assumptions about the underlying properties of the scale were needed other than ordinality. 
Respondents were asked to rate their current approval of each of the main parties on a 5-point 
scale from “‘strongly disapprove” to “‘strongly approve’’. These ratings were then ranked to 
give a preference ordering. Because of the limited response options with a 5-point scale tied 
rankings were common (in about 1/3rd of rankings). Although tied ranks may have indicated 
genuine indifference on the part of the voters, other questions in the BEPS-2 survey proved use- 
ful in differentiating first and second choice for most respondents with tied preferences. Fisher 
(2004) argued for using reported vote to make this distinction, unless the voter had reported 
voting strategically. However, there are two problems with this: first there is ambiguity associ- 
ated with self-report measures of strategic motivation (e.g. see Blais & Nadeau, 1996; Niemi, 
Whitten, & Franklin, 1993); second we require a measure of preference independent of vote, as 
vote choice is included elsewhere in the model. The BEPS/BES measure of party identification 
was considered appropriate for this purpose, as it has been shown to tap the underlying concept 
of current party preference or ‘valenced partisanship’ in the U.K. (Brynin & Sanders, 1997; 
Clarke, Sanders, Stewart, & Whiteley, 2004), as opposed to the long-standing affiliation it 
has been found to represent in the USA (e.g. Campbell, Converse, Miller, & Stokes, 1964; 
Green, Palmquist, & Schickler, 2004). In virtually all cases party identification was consistent 
with the respondent’s approval ratings. Where approval ratings were tied for first place there- 
fore, party identification was used to decide the preference ranking. Of the final sample, a full 
rank ordering of the parties was derived for 76%, with 24% having a clear first preference but 
tied preference for second place. 
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The structure of the ranked party preferences is illustrated in Fig. | in a graphical version of 
Kendall’s (1938) 7 correlation coefficient (Schulman, 1979; Zhang, 2004). Each full rank order- 
ing of the three parties (3! = 6) was assigned a ‘spoke’ and the proximity of the spokes reflects 
the distances between the rank orderings so that the closest are adjacent and the most distant 
(7 = —1) are opposite to one another across the origin. ‘Opposite’ spokes have inverse rank 
orderings and so represent ‘major’ axes of relative preference between pairs of parties. In 
Fig. | the Labour—Conservative axis is labelled (1), the Liberal Democrat—Conservative 
axis is (2) and the Labour—Liberal Democrat axis is labelled (3). 

The six wedge-shaped sectors between the axes each have the same party ranked either first 
or last on their two defining axes, and so each sector represents valence towards or against 
a party. Each sector is labelled with the relevant party name and a plus sign to indicate relative 
preference and a minus sign to indicate relative aversion. For example, the bottom-right sector 
is labelled ‘““LD-+’’, because the Liberal Democrats are ranked first on both of the axes that 
define the sector. 

For each wave separately the percentage of respondents with each ranking was plotted on the 
appropriate axis (with zero at the ‘hub’ and 50% at the ‘rim’) and the points joined to form 
a polygon. Of all the rankings 24% had tied second preferences and were therefore identified 
uniquely only to a sector, not to an axis. These preferences were allocated to axes in proportion 
to the ratio of fully ranked preferences between the axes that constituted the sector’. Fig. 1 
shows that the pattern of preferences changed little from 1997 to 2001, with slight gains for 
the Conservatives and the Liberal Democrats at the expense of Labour. 

Considering first the major axes, the Labour—Conservative axis (1) was the most significant, 
containing 70% of preferences overall, split 2:1 between Labour and Conservative preferrers, 
respectively. The Liberal Democrat—Conservative axis (2) accounted for 24% of the prefer- 
ences, again split approximately 2:1 against the Conservatives. The Labour—Liberal Democrat 
axis was relatively trivial, with only 6% of respondents with preferences on this axis. 

In terms of sectors, the Lab+ sector contained 49% of all preferences, Con+ 32% and LD+ 
19%. However, the Con— sector was the largest overall, accounting for 62% of preferences. 
Combining both Con+ and Con— sectors together accounts for fully 94% of preferences; 
this, coupled with the limited extent of the Labour—Liberal Democrat axis, reflects the notion 
that the Liberal Democrats and Labour are not seen as independent alternatives but are part of 
an anti-Conservative bloc. 

In terms of a single ideological left—right political dimension, the Conservatives are posited 
to be on the right and Labour on the left, with the Liberal Democrats in between (although 
closer to Labour, as suggested above). The above rankings provide good evidence in favour 
of this predominately uni-dimensional interpretation of the preferences; the rankings that are 
not single-peaked on this continuum (‘‘Lab, Con, LD”’ and “Con, Lab, LD”’, i.e. the LD— sec- 
tor) account for only about 7% of the rankings overall. 

In the sectors with positive party valence the ““V’”’ boxes show the percentage of respondents 
who reported voting for their preferred party. Over 90% of both Conservative and Labour pre- 
ferrers said that they voted with their party preference, compared to only 78% of Liberal Dem- 
ocrat preferrers, demonstrating the increase in non-preference voting for preferrers of the 


3 For example, of all the 1997 rankings, 12% were ‘“‘Con, (Lab, LD)”’, i.e. Conservatives ranked first and the other two 
parties tied. These preferences were allocated to “Con, LD, Lab’ and ‘“‘Con, Lab, LD” in the ratio of full rankings 
between these axes, 7:3 in this case. That meant 8.4% were plotted on the “Con, LD, Lab” axis and 3.6% on the 
“Con, Lab, LD” axis. 
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Fig. 1. Percentages of party preference rank-orders for BEPS-2 waves 1997 and 2001, arranged by 
correlation (Kendall’s 7) of ranks. Percentage of vote for most preferred party (V) also shown. 


“third”? party. Overall, across both waves, Labour attracted 47% of the reported vote, the Con- 
servatives 33% and the Liberal Democrats 20%. Given that Liberal Democrat supporters were 
much less likely to vote for their preferred party than were other party preferrers, the fact that 
their vote share was slightly greater than their overall level of first preferences suggests that the 
Liberal Democrats gained votes from other parties’ supporters at least as much as they lost 
votes to other parties from their own supporters (see Russell & Fieldhouse, 2005). 


Modelling framework 


The structure of these party preferences and their relationship to voting was specified as an 
MMNL model, following Skrondal and Rabe-Hesketh (2003) (cf. Glasgow, 2001). One of the 
parties, here the Conservatives, is picked as a reference and the fixed parameters represent the log 
of the odds of choosing (voting for or ranking higher) each of the other parties compared the 
reference party. As well as the fixed parameters, two random variables modelling the correlation 
among choices (Conservative—Labour and Conservative—Liberal Democrat) across individuals 
were also estimated, thereby relaxing the ITA assumption. The random variables are latent, or 
unobserved, and must be specified with a given distribution. Based on Fig. | we hypothesized 
that these latent variables, reflecting correlations among choices, would most likely be multimodal 
rather than normally distributed, with distinct patterns of preference for each partisan block. 
Therefore the latent variables were specified with discrete distributions, i.e. the existence of a finite 
number of latent classes of preference rather than continuously varying preferences. More detail 
on the model framework is found in Appendix A. 

The models were fitted in gl 1amm (Rabe-Hesketh, Pickles, & Skrondal, 2001) using max- 
imum likelihood estimation. BEPS is a multistage sample with a complex design involving 
stratification and clustering. In order to allow for this robust standard errors are reported, based 
upon the Huber/White ‘Sandwich’ estimator (White, 1982) that adjusts for the clustering and 
therefore non-independence of individuals within constituencies. 
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Two equations were specified quantifying the probability of preferring either the Labour 
Party or the Liberal Democrats in comparison to the Conservative Party. The data were from 
two occasions (1997 and 2001), nested within individuals. A dummy variable, VOTE, equal 
to 1 for voting responses and 0 otherwise, was defined to provide a contrast with overall pref- 
erence. The structure of the models is illustrated in Fig. 2. 

As stated earlier the models were fitted with discrete latent variables, which means that 
rather than assuming a normal distribution for the latent party preferences, we assumed (based 
upon Fig. 1) that there would be a multimodal distribution. Models with discrete latent variables 
are also known as latent class models, in which individuals are assumed to belong to one of C 
discrete classes where C represents the number of points in the discrete distribution. Models 
with 2—5 latent classes were estimated and the best-fitting, most parsimonious solution, as 
indicated by the Bayesian Information Criterion (BIC) (Schwarz, 1978; see Vermunt & 
Magidson, 2002) was the one with four classes. The four class solution provided a huge 
improvement in model fit (based on the BIC) compared to a further comparison model fitted 
with normally distributed latent variables, validating the choice of latent classes over normally 
distributed latent variables. 

The four latent classes can be interpreted as representing groups of individuals homogeneous 
in their unobserved political party traits. Fig. 3 shows the locations and probabilities of the 
latent classes. The x-axis represents Labour—Conservative trait variation, the y-axis Liberal 
Democrat—Conservative trait variation. To allow the congruent positioning of the Labour latent 
class on the left and the Conservative latent class on the right of the figure the class locations 
were reflected about the zero point on each axis, so relative Labour and Liberal Democrat pref- 
erences are negative and Conservative are positive. This was purely for display purposes; the 
arbitrary choice of Conservative in the MMNL model meant that relative Conservative prefer- 
ences are actually negative in the models. 


OOOOOOO® 


Fig. 2. Model 2. Mixed multinomial logit model of ranked preference and vote for the Labour (Lab) and Liberal 
Democratic (LD) parties (in relation to the Conservative Party) in 1997 and 2001. Arrows denoting Lab effects are solid, 
LD arrows are dashed. Larger circles represent latent party political traits for Lab (n') and LD (n’) (and their correla- 
tion, Y); smaller circles represent error in the fixed effects estimates (¢;—€g; defined by the logistic distribution and not 
estimated model parameters). 
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Switch 6% 


LD-Con 


Lab 41% 


Lab-Con 


Fig. 3. Latent political party trait space from Model 1. The x-axis is the Lab—Con dimension, the y-axis the LD—Con 
dimension. The four latent classes are plotted in this space, labelled with plausible party/ideological affiliation (Lab = 
Labour, LD = Liberal Democrat, Switch = Party Switchers, Con = Conservative) and the percentage of respondents in 
the class. 


Respondents were assigned to the latent class to which they had the highest probability of 
membership.* The classes were labelled according to their locations in the latent space and 
the modal party or parties of first preference. The distribution of the classes lies close to an 
axis running (roughly) from bottom left to top right. This reflects the pattern seen in Fig. 1, 
a strong positive correlation between Labour and Liberal Democrat preferences and the nega- 
tive correlation of those parties’ preferences with Conservative preferences. A further compar- 
ison model was fitted to see if a simpler specification with just one latent variable, reflecting 
pro- vs. anti-Conservatism, would provide an adequate fit to the data. This model was substan- 
tially worse-fitting than the two-dimensional model, as indicated by the BIC statistic, which 
penalises models for over-parameterization, showing that a uni-dimensional characterization 
would be an over-simplification. 

In keeping with the distribution of party identifiers during the period, the Labour and Con- 
servative classes were the largest, accounting for 41% and 31% of the respondents, respectively. 
Over 90% of the participants assigned to these classes were consistent in their party preference 
(when data were available for both years) and reported voting for their latent-class party. The 
third largest class, representing 22% of the sample, was Liberal Democrat. However, only 65% 
of those observed in both years consistently reported the Liberal Democrats as their top-ranked 
party, confirming Liberal Democrat support as more ephemeral than that of the other parties. 


* Class assignment was achieved using Empirical Bayes Modal scoring, the standard method in latent class modelling. 
This involves computing the posterior probabilities of class membership for each class, taking into account the modelled 
parameter estimates. The participant is then assigned to the latent class with the largest (modal) posterior probability. 
For technical details see Skrondal and Rabe-Hesketh (2004, p. 235). 
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Eighty-four percent of votes in this class were for the Liberal Democrats, with virtually all of 
the remainder going to Labour, indicating a possible strategic movement between the two “anti- 
Conservative’ parties. 

Rather than a simple three-class solution (i.e. Labour, Conservative and Liberal Democrat 
classes) the best-fitting model contained four classes. The smallest class was labelled switchers 
and predominantly comprised respondents who changed allegiance to and from the Conserva- 
tives across elections. In this class the most common pattern of preference (amongst those with 
information for both waves) was for Labour in one wave and the Conservatives in the other 
(41% of individuals in the class). These included switchers in both directions in almost equal 
magnitude. The next most frequent preference was between the Conservatives and the Liberal 
Democrats (33%), again with switching direction split almost equally. Overall, 54% of votes in 
this class were for the Conservatives, with the remainder going 21% to Labour and 25% to the 
Liberal Democrats, with vote almost always accompanying preference. The class appears there- 
fore to represent those who wholeheartedly moved towards or away from the Conservatives 
over the period and changed their voting behaviour accordingly. 

Table 1 shows the full model estimates from Model | in the second column. Looking at the 
fixed effects log odds-ratio estimates, both Labour and the Liberal Democrats are strongly pre- 
ferred overall to the Conservatives in 1997. However, the vote contrasts for both parties were 
negative, indicating that neither the Liberal Democrats nor Labour were winning as many votes 
as might have been expected from the distribution of expressed preferences. The preferences 
were broadly similar in 2001 but this time the vote contrast for the Liberal Democrats was 
not significant, i.e. compared to the Conservatives they were receiving about as many votes 
as their overall popularity would suggest. 

Model | indicated that around the time of the 1997 General Election, when the Conservative 
Party that had been in power for 18 years was defeated by a Labour landslide, Labour and 
Liberal Democrat supporters reported voting for their preferred party relatively less than did 
Conservative supporters. This could have reflected disillusionment on the part of some Conser- 
vatives supporters who, while expressing dissatisfaction with their party (and therefore relative 
satisfaction with the others), nonetheless went on to vote Conservative. Significantly for our 
argument, it could also reflect the influence of strategic voting on the part of Labour and Liberal 
Democrat supporters who voted, not in favour of their preferred party, but to oust the 
Conservatives. 


Incorporating strategic voting 


Alternative approaches to estimating the effects of strategic voting were discussed earlier, 
each of which can be incorporated here. The discrepancy between preference and vote (e.g. 
Black, 1978) is inherent in the models already presented and is indicated by the VOTE con- 
trasts. The subjective approach classifies a voting decision as strategic if the respondent indi- 
cates this as their motivation for casting their vote (e.g. Evans & Heath, 1993). A dummy 
variable was therefore defined for those indicating a strategic motivation for their vote 
(STRAT = 1, or 0 otherwise as determined by the Evans and Heath measure). Strategic moti- 
vation was thought likely to affect the vote decision itself rather than the underlying political 
preferences, so STRAT was specified to act only on the voting response (i.e. VOTE x STRAT). 
The effect of STRAT was not hypothesized to change over time, but it was hypothesized that 
the Liberal Democratic Party would be subject to strategic voting to a greater degree than the 
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Table 1 

Estimates for models with discrete latent variable distribution 

Effect Model 1 Model 2 Model 3 Model 4 

Est. SE Est. SE Est. SE Est. SE 

Fixed effects 

Labour (vs. Con.) 
1997 2.45** 0.26 1.69** 0.29 1.81** 0.33 La 0.34 
1997 Vote contrast —0.70** 0.15 —0.55** 0.16 —0.06 0.18 —0.02 0.20 
2001 221** 0.29 1.44** 0.29 1.52** 0.34 1.45** 0.35 
2001 Vote contrast —0.35** 0.15 —0.11 0.17 —0.20 0.20 —0.13 0.23 
STRAT - - —1.53** 0.56 _ _ —1.17 0.74 
PVM* a a _ - 0.27** 0.11 0.26 0.17 
THIRD = = = = 0.15 0.75 0.51 0.60 
PVM x THIRD* a a = = O:71** 0.32 0.70** 0.26 

LibDem (vs. Con.) 
1997 1,23** 0.17 1.05** 0.25 0.82** 0.27 0.95** 0.32 
1997 Vote contrast —0.54** 0.16 —0.80** 0.17 1.03** 0.28 0.68* 0.38 
2001 1.40** 0.22 1.20** 0.26 0.92** 0.29 1.07** 0.36 
2001 Vote contrast —0.18 0.17 —0.36** 0.16 0.99** 0.24 0.67* 0.37 
STRAT — = 0.68 0.51 = = 0.56 0.59 
PVM* - - _ - 0.98** 0.18 0.87** 0.15 
THIRD a - a a —1.06** 0.38 —0.76 0.61 
PVM x THIRD* = = = = —0.40 0.30 —0.24 0.37 

Random effects 

Trait Labour (n') 25.42 23.51 27.55 24.75 

Trait LD (77) 9.02 12.99 13.51 12.93 

Correlation W,1,2 0.92 0.91 0.93 0.93 

Latent class probability” 
Labour 0.41 0.49 0.45 0.48 
Liberal Democrat 0.22 0.14 0.18 0.15 
Switchers 0.06 0.09 0.09 0.09 
Conservative 0.31 0.28 0.28 0.28 

Parameters 17 19 23 25 

Log-likelihood —4545.45 —4517.43 —4447.30 —4435.50 

BIC 9219.37 9178.44 9068.41 9059.92 


Notes. N = 1914 voters and N = 2892 voting occasions for all models. Significant effects: *p < 0.10, **p < 0.05. 
* Effect of prior vote margin is per 10,000 votes. 
> Maximum a posteriori probability based on Empirical Bayes estimation. 


other parties (see Kim & Fording, 2001; Russell & Fieldhouse, 2005; Spafford, 1972). There- 
fore separate coefficients for STRAT were estimated for each party comparison (Model 2). 
The third approach was to examine the impact of the objective conditions for strategic voting 
(Model 3). This involved codifying this context in each constituency and then including this 
information into the model, to see the extent to which it could account for the discrepancies 
between preferences and votes (cf. Alvarez & Nagler, 2000). The concept of distance from 
contention (DFC; Cain, 1978; Niemi et al., 1992) was used to define the election context. 
DFC represents the relative strength of the parties in each electoral district based upon their 
previous election performance (here, the 1992 election for the 1997 wave and the 1997 election 
for the 2001 wave). DFC is computed by subtracting the number of votes polled by the 
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second-placed party from the number of votes polled by each party in turn. So, if a party came 
first then DFC would be positive, if the party came second itself then DFC would be zero, and if 
the party came third it would be negative. 

Because the MMNL models used here compare each party to a reference alternative, the 
difference in DFC scores between each party compared to the reference was needed. The rel- 
ative DFC between two parties, (party 1 — second place)—(party 2 — second place), simplifies 
to (party | — party 2), i.e. the simple distance in votes between the parties which we will label 
Prior Vote Margin (PVM). With the Conservatives as the reference party PVM was therefore 
positive if the party polled ahead of the Conservatives in the previous election and negative 
if behind. PVM was divided by 10,000 to put it in a scale comparable with the vote/preference 
measures. The mean values of PVM for Labour and the Liberal Democrats were 0.0914 and 
—1.0716, respectively, i.e. per constituency, Labour polled around 1000 votes more, and the 
Liberal Democrats about 10,000 votes less, than the Conservatives, on average. 

PVM on its own did not capture the full electoral context because it did not encode the party 
positions; parties in two separate districts might have the same PVM but one might be in second 
place and the other third, say. According to many accounts of strategic voting within rational 
voter theory (e.g. Cain, 1978; McKelvey & Ordeshook, 1972) being in third place (or worse) is 
one of the most potent incentives for strategic voting. For this reason a dummy variable THIRD 
was defined, equal to | if the party was in third place in its electoral district and 0 otherwise. 
PVM, THIRD and their interaction were therefore election and alternative-specific variables, as 
before loading only on the vote response. It was again hypothesized that the smaller Liberal 
Democrat Party would prove more sensitive to strategic conditions than the larger parties. To 
allow the coefficients of these alternative-specific predictors to vary by party 2 dummy vari- 
ables lab and LD were defined, 1 if the alternative was the Labour or Liberal Democrat parties, 
respectively, and O otherwise. 

A further model was estimated that combined Models 2 and 3 to give a model with all of the 
strategic variables STRAT, PVM, THIRD and PVM x THIRD, to assess the relative effects of 
self-reported vs. contextual measures of strategic voting (Model 4). 

Table | also shows the full results for Models 2—4. The addition of the self-described stra- 
tegic voting dummy variable STRAT gave Model 2 an improved fit over Model 1, BICs of 
9178.44 and 9219.37, respectively. The pattern of fixed effects was virtually the same over 
both models, with Labour and the Liberal Democrats being preferred overall to the Conserva- 
tives in both election waves but having lower vote contrasts (i.e. poorer conversion). The effect 
of STRAT was positive for the Liberal Democrats and negative for Labour, although significant 
only in the latter case. This suggested a net vote transfer away from Labour by self-reported 
tactical voters. This interpretation is supported by the rise in Labour’s latent class probability 
from 0.41 to 0.49; it seemed that some strategically voting Labour supporters were misclassi- 
fied in Model 1, which did not have the benefit of strategic voting information. The Liberal 
Democrat latent class probability dropped to 0.14 (from 0.22) indicating that a number of Lib- 
eral Democrat voters were also misclassified. There was also a slight reduction in the Conser- 
vative latent class probability from 0.31 to 0.28 indicating that some Conservative supporters 
voted tactically for the Liberal Democrats (see also Russell & Fieldhouse, 2005). The results 
from Model 2 are consistent with the pattern of strategic voting that would be expected based 
upon the hypothesized positions of the parties in ideological space, providing evidence in sup- 
port of the subjective measure of strategic voting. 

Instead of self-reported strategic motivation, Model 3 included the effects of PVM and 
THIRD, and so was based upon contextual rather than individual information. This model 
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provided a huge improvement in fit compared to both the naive Model | (without any strategic 
voting information) and Model 2 (with self-reported strategic motivation). For both Labour and 
the Liberal Democrats the effect of PVM was significant and positive, i.e. the probability of 
voting for either party increased with their vote from the previous election, in districts where 
the parties were in first or second place at the prior election. The coefficient for the Liberal 
Democrats was significantly higher than that for Labour. It therefore seemed, as expected, 
that the Liberal Democrat vote was much more sensitive to local party strength than was the 
Labour vote. 

The effect of being third in the prior election did not significantly harm the voting prob- 
ability for the Labour Party. For the Liberal Democrats, however, being in third place signif- 
icantly reduced the voting probability, as would be expected. The effect of PVM x THIRD, 
i.e. the effect of PVM when a party is in third place, was significant and positive on the 
Labour vote. Because PVM is always negative for parties in third place this implied a reduc- 
tion in vote probability for Labour the greater this prior vote “distance” — i.e. sensitivity to 
the size of the vote deficit for Labour when in third place. The effect of PVM x THIRD was 
non-significant for the Liberal Democrats, however, implying that when the Liberal Demo- 
crats are in third place the distance they are from the other parties does not make any addi- 
tional difference for voting probability. This is consistent with the notion of Duvergian 
strategic voting. 

Unlike STRAT, taking into account these contextual effects altered the pattern of coef- 
ficients for the overall preferences and votes quite radically. As before, both the Liberal 
Democrats and Labour were significantly preferred to the Conservatives in both election 
years. However, the vote contrast coefficients, which were negative in Models | and 2, 
were significant and positive for the Liberal Democrats. This suggested that after taking 
into account the relative strength of the parties at the prior election in the constituency, 
the Liberal Democrats were actually more likely to receive votes, given their overall level 
of preference, than were the Conservatives. 

Model 4 included STRAT, PVM and THIRD and again gave further improvement in model 
fit (BICs of 9059.92 vs. 9068.41 for Model 3). This suggested that the individual-level and 
contextual-level variables were providing somewhat independent sources of information on 
the discrepancy between preference and vote. The pattern of significant fixed effects in this 
model was similar to that found in Model 3. For the Labour effects the PVM coefficient was 
almost identical but no longer significant because of a larger standard error. The Liberal 
Democrat vote contrasts were still positive but only significant at the 0.10 level. STRAT was 
no longer significant for either party, nor was THIRD for the Liberal Democrats. The apparent 
dilution of the effect of these strategic variables when estimated together in the same model 
suggests that they share information, i.e. strategic voting occurs when the contextual factors 
militate for it. 

Finally, Model 5 (Tables 2 and 3) extended Model 4 to include demographic and socio- 
economic covariates all measured in 1997. These were dummy variables for: sex, age (three 
variables), social class (two variables), tertiary education and homeownership. The covariates 
were included into the model as predictors of the latent classes, having their effects on the 
preference and voting responses indirectly via the latent political party traits. The results of 
this model are displayed in Table 2. Model 5 again provided a significant improvement in 
log-likelihood (—4349.75) over the equivalent without covariates (Model 4; —4435.50). 
However, because Model 5 had nearly double the number of parameters (49) it had a worse 
BIC statistic, 9069.79 compared to 9059.92 for Model 4. 
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Table 2 

Estimates for Model 5 with demographic covariates 

Effect Lab—Con LD—Con 

Est. SE Est. SE 

Fixed effects 
1997 —0.46 0.45 —0.69 0.38 
1997 Vote contrast —0.03 0.21 0.69 0.30 
2001 —0.73 0.45 —0.56 0.39 
2001 Vote contrast —0.13 0.26 0.67 0.29 
STRAT —1.02 0.45 0.61 0.35 
PVM* 0.25 0.13 0.86 0.16 
THIRD 0.47 0.59 —0.72 0.42 
PVM x THIRD 0.70 0.28 —0.20 0.29 

Random effects 
Latent trait 7 26.25 14.17 
Correlation ,,,2 0.96 

Parameters 49 

Log-likelihood 4349.75 

BIC 9069.79 


Notes. N = 1914 voters and N = 2892 voting occasions for all models. Significant effects (p < 0.05) are shown in bold. 
* Effect of prior vote margin is per 10,000 votes. 


Because the addition of the covariates did not improve model parsimony, for brevity only 
a summary of the results for the demographic covariates, as shown in Table 3, will be given 
here. The locations and probabilities of the latent classes were much as found previously 
(changes of the order of 1% in class probabilities), although the estimated correlation between 
the latent traits increased to 0.96. The effects of the covariates were as expected: being younger, 
working class and a non-homeowner increased the probability of being in the Labour and Lib- 
eral Democrat latent classes compared to being in the Conservative class. Having a tertiary 
qualification increased the probability of Liberal Democrat class membership. Sex had no effect 
on class membership. None of the covariates significantly predicted Switcher class membership 
over Conservative class membership, indicating the similarity of people allocated to these clas- 
ses. Overall, the results from the final model confirmed the stability of the latent classes and 
also the similarity of the pro- and anti-Conservatives in terms of demographic and socio-eco- 
nomic information. 


Estimating strategic vote 


Fig. 4 shows the relationship in the raw data between strategic motivations for vote and 
whether the vote was discrepant to the stated preference ranking, i.e. a non-preference vote. 

Taking the percentage of self-described strategic votes at face-value suggested that approx- 
imately 10% of votes were cast strategically. However, only 6% of votes were cast both against 
preference and declared to be strategic, whereas 15% were one or the other. Moreover, 4% of 
votes were declared strategic but cast according to stated preference, indicating some problem 
with the interpretation of the question. The majority (80%) of these “with-preference strategic 
voters’ selected an alternative party when subsequently asked what their “really preferred 
party” was, however. 
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Table 3 
Covariate effects for Model 5 


Latent class 


Lab LD Switcher Con 

Est. SE Est. SE Est. SE Est. SE 
Latent class probability* 0.48 0.15 0.09 0.28 
Lab—Con (7!) location 6.74 _ 2.47 _ 0.03 = —5.09 _ 
LD—Con (77) location 4.03 _ 4.84 _ 0.33 _ —3.88 - 
Covariates 
FEMALE —0.07 0.12 0.03 0.16 —0.07 0.22 = = 
AGE 18—30 0.57 0.21 0.67 0.28 0.55 0.34 a - 
AGE 31—45 0.79 0.16 0.71 0.22 0.18 0.30 _ _ 
AGE 46-60 0.36 0.16 0.38 0.23 0.07 0.29 = = 
AGE > 61 _ = _ _ _ _ a a 
GHClass salariat _ _ _ _ _ _ _ _ 
GHClass intermediate 0.17 0.15 0.15 0.21 —0.06 0.27 _ - 
GHClass working 1.16 0.19 0.70 0.26 0.43 0.34 _ _ 
TERQUAL 0.21 0.16 0.79 0.21 0.48 0.28 = = 
NOHOMEOWN 0.80 0.16 0.44 0.21 0.16 0.30 — a 
Class intercept —0.40 0.18 —1.51 0.27 —1.18 0.31 _ _ 


Notes. N = 1914 voters and N = 2892 voting occasions for all models. Significant effects (p < 0.05) are shown in bold. 
* Maximum a posteriori probability based on Empirical Bayes estimation. 


Almost half of the non-preference votes (5% of total votes) were not declared strategic; 
when asked their motivation for their vote about 70% of these voters endorsed the statement 
“T voted for the best party.” This appears highly inconsistent; it may have represented measure- 
ment error in the party approval rankings or more likely in the motivation for vote question 
(Niemi et al., 1993), given the ambiguity of the word “‘best’’ in the above response (‘sincerely’ 
best or ‘strategically’ best, for instance). 


85% Sincere 
Strategic 


Non- 
preference 


Fig. 4. Percentage of voters cross-classified by self-reported strategic motivation and voting against first ranked prefer- 
ence (i.e. non-preference voting). 
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The lower and upper limits on strategic voting suggested by these raw data lie between 6% 
(strategic and non-preference voters) and 15% (strategic or non-preference voters), in keeping 
with previous research (e.g. Clarke et al, 2004; Heath & Evans, 1994). It is clear that these raw 
data contain too much inconsistency and ambiguity to provide a more accurate estimate of stra- 
tegic voting on their own (see also Blais & Nadeau, 1996). A more integrative approach was 
used by Alvarez and Nagler (2000) to provide model-based estimates of strategic voting. 
They looked how predictions of vote based upon their model of voting were influenced by 
removing strategically important covariates such as vote share. 

Here, we took the parameter estimates from the most parsimonious model, Model 4, as 
a starting point. Based on the model parameters a probability was computed for each 
person-election for the likelihood of a vote being cast for each party (as was done for latent 
class assignment, earlier). The party with the highest probability was considered as that indi- 
vidual’s predicted vote. A second predicted vote was also made, but this time based upon 
a restricted model with the STRAT, PVM, THIRD and PVM x THIRD covariates set to 
zero, i.e. with the influence of the strategic variables removed. The two predicted votes were 
then crosstabulated to show the number of votes that were predicted to be “‘cast’’ for different 
parties when incorporating strategic information or not. (An important precondition for this 
approach was that the underlying model of observed vote be plausible; Model 4 predicted 
91.1% votes correctly which was considered acceptable.) 

The crosstabulation of full vs. restricted model-predicted votes is shown in Table 4. The 
entries off the main diagonal were the ones classified differently when taking into account 
the ‘strategic’ variables or not. 

Of the predicted votes influenced by the strategic variables, the majority (191) was forecast 
for the Conservatives under the full model and for the Liberal Democrats under the restricted 
model. This was surprising as we would have expected that sensitivity to the strategic context 
would be manifest mostly between the Liberal Democrats and Labour. Further inspection 
revealed that each one of these 191 votes was cast by a voter allocated to the Switcher latent 
class. This class consisted of individuals who had preferred and voted for the Conservatives at 
least once across the two election years, but who had preferred and voted for another party in 
the other year. This led to their latent class being located somewhat centrally in latent space, at 
a point defined by the weighted average of the class members’ preferences and votes over the 
two election years. This point was closer to the location of the Conservative latent class than to 
the Labour latent class because of the preponderance of Conservative preferences and votes of 
its members. The limited influence of the latent trait for the Switcher class thus allowed the 
other covariates to exert a relatively greater influence than they did on the other latent classes, 
tipping the predicted vote more easily in the Conservative rather than the Labour direction. 


Table 4 
Predicted vote based upon full and restricted” Model 4 
Full model-predicted vote Restricted Model" predicted vote 

Con Lab LD Total (%) 
Con 845 0 191 1036 (36) 
Lab 0 1340 5 1345 (46) 
LD 0 69 442 511 (18) 
Total (%) 845 (29) 1409 (49) 638 (22) 2892 (100) 


* Vote predicted from parameters estimated for Model 1:3, but with STRAT, PVM, THIRD and PVM x THIRD 
covariates set to zero; 9.2% off-diagonal votes, i.e. those predictions affected by ‘strategic’ covariates. 
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On a more general level, the interpretation of the latent trait for the switcher class is one of 
latent instability of preference rather than just of latent party political preference; the election 
waves could have been modelled separately, but then the Switcher latent class would not have 
emerged, as the class members have consistent preferences and votes within individual elec- 
tions. A very promising approach, beyond the scope of this paper, would be to explicitly model 
the changing structure in the latent classes over time (cf. Vermunt, 2003). 

Overall, slightly more than 9%° of all votes were classified differently by the two models, 
which is a plausible estimate of the influence of strategic factors on vote. It is important to note 
that this estimate of strategic voters is based on a model which considers both the objective and 
subjective conditions for strategic voting. Insofar as there is an imperfect relationship between 
the two (as illustrated in Fig. 4), the subsequent estimate is higher than would be derived using 
either subjective or objective criteria alone. Indeed alternative specifications of this model 
indicate that if we restrict the analysis to the objective conditions for strategic voting (i.e. 
non-preference voting consistent with the constituency context) strategic voting in 1997 and 
2001 may have been as low as 5% of all votes cast. 

Using this broad based definition of strategic voting, Liberal Democrat vote was the most 
sensitive to the strategic conditions as hypothesized. Indeed they were predicted to have 
22% of the vote by the restricted model, but only 18% with the full model (the lower figure 
mirroring their actual vote share over the two elections). This implies that the Liberal Demo- 
crats might have expected their vote share to have been 4% higher in the absence of strategic 
contextual factors and motivations, i.e. on a strategically ‘level’ playing field. 


Discussion 


Based on latent class models the Labour and Liberal Democrat classes were found to be 
distinct members of a predominately centre—left bloc, with distinct patterns of preference 
but whose members seemed prepared to share votes strategically against the Conservatives. 
Although the Liberal Democrat class was found to exist between the two major parties in 
preference-space (and “nearer” to Labour than the Conservatives) it was found to be an over- 
simplification to place the parties on a single, left—right political continuum. Rather, the signif- 
icant minority who did not have “single-peaked” preferences meant that models with two 
(albeit highly correlated) dimensions of latent political traits were required. This multidimen- 
sionality was attenuated with the addition of strategic voting and demographic information into 
the models, however. 

A naive model of preference and vote that did not take the strategic local situation into 
account was found to be misspecified. Information regarding strategic context was found to 
be more informative of the discrepancy between preference and vote than was a self- 
reported measure of strategic voting. This could mean that the majority of non-preference 
voters did not consider themselves to be strategic voters, though this would suggest that the 
same factors that are assumed to influence strategic voting also strongly affect other types 
of non-preference voting (Niemi et al., 1993). Although the subjective measure produced 
results that were difficult to interpret and that may have misestimated the prevalence of 
strategic voting it did improve model fit over and above the contribution of the purely con- 
textual measures. However, these contextual measures, particularly the straightforward vote 
distance between the parties (PVM), although based on four-year-old data provided the 


> This figure was virtually the same, 8.9%, when estimated using Model 5, the model with demographic covariates. 


E. Fieldhouse et al. / Political Geography 26 (2007) 159-178 175 


greatest impact on model fit. This suggests that the objective conditions for strategic voting 
provide a better guide to explaining counter preference voting than subjective information 
about motivations for vote. 

Whilst it is important to acknowledge that not all non-preference voting need be strategic, as 
other non-strategic motivations may exist (e.g. protest voting) the clear association between 
non-preference voting and objective conditions for strategic voting suggests that even where 
voters cannot or will not articulate a strategic motivation when asked, voters are rejecting their 
preferred party where it has no chance of winning. This implies strategic voting may be more 
widespread than estimates based on subjective measures alone would suggest (see also Niemi 
et al., 1993). 

Comparing predicted votes based upon the full model and a model with no ‘strategic’ vari- 
ables suggested that 9% of the votes were sensitive to these variables. (This figure is an average 
based on all constituencies; because not all voters were in constituencies where their preferred 
party was not in contention the actual proportion of ‘sensitive’ votes will be higher, see Alvarez, 
Boehmke, & Nagler, 2006.) This provided an estimate of strategic voting that combines non- 
preference voting, self-reported motivation as well as the objective contextual conditions. The 
use of latent variables for the underlying party preferences allowed these multiple, sometimes 
conflicting, sources of data to be combined within a principled modelling framework. As we 
hypothesized, consistent with the Duvergian model of strategic voting, it was the vote of the 
smallest of the three main parties that was found to be most sensitive to strategic factors, 
and the net losers of strategic voting. 

The broader implications of these findings are of relevance to political geography and more 
specifically electoral geography. Of key significance is the fact that objective measures of 
strategic voting are more predictive of counter preference behaviour than subjective measures, 
implying that strategic context may operate in ways which are not directly observable through 
self-reported measures of strategic voting. It may be, for example, that voting is indirectly influ- 
enced by strategic context through sub-conscious discounting of the utility of voting for (rather 
than expressing preference for) smaller parties; through the targeted mobilisation efforts of 
political parties (see Fieldhouse, Pattie, & Johnston, 1996); and through tactical abstention 
as well as through the conscious act of voting for a less preferred party. Furthermore the objec- 
tive conditions for tactical voting as described here work to the disadvantage of locally weaker 
parties, creating feedback effects whereby parties cumulatively gain through strategic voting in 
areas where they are strongest (see also Russell & Fieldhouse, 2005). If these findings can be 
replicated elsewhere they would suggest that spatial clustering of voting, at the constituency 
level at least, may be largely attributable to the accumulation of the actions of rational strategic 
voters. 


Appendix A. Specification of the mixed multinomial logit framework 


We used the random utility framework presented by Skrondal and Rabe-Hesketh (2003) to 
derive the MMNL model. In a random utility model, 


Ut =Vi 4+ 2", (1) 


U* (here a = 1, 2, 3), the subjective utility of party a, is modelled as comprising two parts: V“, 
the measured characteristics of the chooser or choice alternative, e.g. age of voter, age of 
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candidate, and e“, a random component representing unmeasured idiosyncrasies. Party | is then 
chosen over party 2 if U! > U?, or equivalently, 


U'>W=V'-V*4+(e!—&) >0. (2) 


If e* has a Gumbel distribution then the differences (e! — e”) have a logistic distribution 
(McFadden, 1973), and the probability that U' exceeds U? is 


exp(V') 
exp(V!) + exp(V?) 


Pr (U' > U\= (3) 


If V“ is parameterized as a linear combination of subject-specific covariates x, and the coeffi- 
cients for the third, reference alternative are set to 0 (for identification), the standard multino- 
mial logit model results, e.g. for the probability of choosing alternative 1 compared to the 
reference alternative (here, alternative 3): 


exp(B'x) 
1 + exp(B'x) + exp(B°x) 


Pr(choice = 1) = (4) 


This specification assumes (¢! — e*) and (e* — e*) are independently distributed (leading to the 


IIA assumption), which can be relaxed by introducing random effects U“ to model the corre- 
lation between choices, 


US =Ve + ue +. (5) 


The differences between the random effects for each category compared to the reference (i.e. 
latent preferences) are denoted as 7%, 


gau—8, Pa=aw—w. (6) 


The probability of choosing alternative 1 then becomes 


exp(B'x + 7') 1.3 
Pr(choice = 1) = ff. Petal (rane weeny (ae a C8 (7) 


for some distribution G(7!n7) of the latent variables. Commonly the latent variables are as- 
sumed multivariate normal but we specified the latent variable distribution as a finite number 
of discrete points, i.e. latent classes. 

The above specification is for first choice. More statistically efficient estimates of preference 
can be obtained from rankings (Hausman & Ruud, 1987), a form of data that fit neatly into the 
MMNL framework using the Luce model for ranked preferences (Luce, 1959). With three al- 
ternatives the first choice probabilities are as for the original model; second choice probabili- 
ties, conditional on the first choice, are given by the same multinomial form, but with the first 
choice excluded from the choice set. For example, the probability that alternative 1 will be 
ranked first, followed by alternative 2 second (with the final choice redundant) is 
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Pr(1st choice = 1, 2nd choice = 2) 


ee ae ee 
nel + exp(B'x+ n') + exp(B°x + 1) 1 + exp(B°x + 1) 
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